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Abstract — The Fault Tree Analysis shows the possible 
causes of a system malfunction by enumerating the suspect 
components and their respective failure modes that may 
have induced the problem. The complex systems often use 
fault trees to analyze the faults. Fault diagnosis, when it 
occurs, is performed by engineers and analysts performing 
extensive examination of all data gathered during the 
mission. International Space Station (ISS) control center 
operates on the data feedback from the system and decisions 
are made based on threshold values by using fault trees. 
Since those decisions making tasks are time critical and 
must be done promptly, the engineers who manually analyze 
the data are facing the time challenge. To automate this 
process, this paper present an approach that uses decision 
trees to capture the contents of fault trees and detect fault by 
running the data through the decision trees in real time. 
Decision trees (are also called classification trees) are the 
binary trees built based on data, it can classify the objects to 
different classes. In our case, the decision trees can classify 
different fault event or normal event. Given a set of data 
samples, decision trees can be built and trained, and then by 
running the new data through the trees, classification and 
prediction can be made. In this way, diagnostic knowledge 
for fault detection and isolation can be represented as 
diagnostic rules; we call this tree the diagnostic decision 
trees. By showing the fault path in decision tree, we also 
can point out the root cause when a fault occurs. Since all 
the procedures and algorithms are available to build decision 
trees, the trees built are cost effective, time effective. 
Because of the diagnostic decision trees are based on data 
and previous knowledge of logic, the DDT can also be 
trained to predict fault, detect unknown fault. Based on this, 
the needs for on board or service bay, real time oriented 
diagnostics can readily be met. Diagnostic Decision Trees 
are built based on the fault trees as static trees that service as 
the fundamental diagnostic trees. And the dynamic DDTs 
are built over time from the operation data. The dynamic 
DDT will add the functionalities of prediction, and will be 
able to detect unknown fault. Crew or maintenance 
engineers can use the decision tree system without having 
previous knowledge or experience about the diagnosed 
system. To our knowledge, this is the first paper to propose 
a solution to build diagnostics decision trees from fault tree. 


which convert the reliability analysis models to diagnostic 
models. We show through mapping and ISS examples that 
the approach is feasible and effective. We also present 
future work and development. 
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1. Lntroduction 

Fault tree concept is developed by Bell Telephone 
Laboratories in 1962 for the U.S. Air Force for use with the 
Minuteman system. It was later adopted and extensively 
applied by the Boeing Company, is one of the most widely 
used methods in system reliability analysis for a long time 
[3], It is a deductive procedure for determining the various 
combinations of hardware and software failures, and human 
errors that could result in the occurrence of specified 
undesired events (referred to as top events) at the system 
level. As part of the analysis, the minimal cut sets of a fault 
tree can be determined [2]. And then fault tree can be built. 
Individual fault tree can be visualized and draw'. Fault trees 
are usually individually built for each part of the system for 
each top event. It is hard to have generic software to traverse 
fault trees. In the other hand, the decision trees are matured 
data structure and it is very easy to manipulate in a software 
program. Using decision tree to represent fault tree will 
increase the operability and decrease response time for 
system diagnostic, and furthermore, its visualization will 
make users easier to see the root cause of the fault and path 
from which the fault came. As the high availability of many 
different tree algorithms implementations in computer 
science field, using decision tree to manage the fault tree no 
doubt is one of best consideration. In this paper, we present 
a method to convert existing fault trees to decision trees. 
More general way of constructing decision tree is presented. 
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The method is easy to be programmed and run on a 
computer since the decision tree algorithms has many 
available implementations [4], This method also provides a 
good tool for researchers on simulation and prediction tasks. 
By using this method, one can analyze the data samples 
from the past and categorized them into different classes: 
abnormal, normal, and fault event in such a way that future 
fault can be predicted from the past. This could be a data 
mining and run time updating. Such artificial intelligent 
application is presented in the paper as form of framework 
architecture. 

Where this document is silent on a formatting question, it is 
because it is not important or is the writer's option. When in 
doubt make a choice that makes your document readable. 


2. Conversion Method 
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There are some other attempts to represent fault trees by 
other forms; one of them is building diagnostic map from 
fault tree [1], Decision trees are the trees usually, built on 
data. Let’s look at a fault tree and see how can we map it to 
the decision trees. Take a sample fault tree in the form of 
following: 


Voltage A Over Trip set point 129 


Validity 0(1 = mvalid) 

3 consecutive readings - 


Reconfig. deactive 1(0= active) 


Figure 2 Decision Tree for Over Voltage 

To show more common case that includes OR gate in the 
fault tree, the other example is shown here: 
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Validity 0(1 = invalid) • 
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Figure 3 GNC Fail Event Fault Tree 


Figure 1 Over Voltage Event Fault Tree 

We have 6 inputs to the trees. The inputs remain the same 
for decision tree. The corresponding decision tree should 
have the same functionality in terms of samples inputs and 
fault triggers. In the other words, the same inputs to the fault 
tree, or to the decision tree will have the same result. The 
corresponding tree can be built as in Figure 2. If the data can 
reach all the way to terminal node 6, we have known that the 
system is at over voltage fault. The decision tree not only 
provides the final result if the system is at fault status, it can 
also provides the interim status by looking at where the 
sample data end up to. 


In this case, more balanced data inputs will end up with 
more evenly distributed decision tree. In a same way as we 
showed earlier, the corresponding decision tree is presented 
as follows: 
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Figure 4 Decision Tree for GNC Fail 
We had demonstrated that fault trees can be shown and 
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mapped to decision trees with examples of the over voltage 
and GNC fail fault trees. The convenient use of decision tree 
is that the available decision tree software program can 
easily pin point out a root cause of an event ( including fault 
event) by recording the edges in the path of the tree when 
giving reports and evaluation of the system status. For more 
general purpose and more easy illustration, we can abstract 
the information into a map and construct a decision tree 
from map. The map basically represents the different events, 
including fault event, in the n dimension space. When we 
deal with decision trees, we call the events classes. The class 
could be fault, nominal, or warning etc. This demonstration 
shown that the decision tree not only can derived from fault 
tree nut also can be construct from data samples, which is 
very useful in the real time fault detection and prediction. 



Figure 5 Classes distributions 

To map to the decision tree, we use an example to explain it. 
We assume faults in the two dimension space for 
simplification of visualization. Multiple dimensions will 
follow the same rules. In Figure 5, we give an example of 
the fault scenarios, the cylinders represent normal, called 
class 1 and cubes represent faults, called class 2. We will 
use cumulative distribution function ( cdf) for tree 
construction. The cdf is the probability that the variable 
takes a value less than or equal to x. That is 

= Pr[X < x\ = cc (j) 

This can be expressed mathematically for continuous 
distribution: 

F i x ) = [ ffaW 

J —oo (2) 

For a discrete distribution, the cdf can be expressed as 

F{ *) = £/(*) 

t=0 (3) 


When we construct a decision tree, we have a root, then we 
have two branches, further, each of those branch can, has 
maximum of two branches, until no further branches, we 
reach the bottom of the tree and we done. Those procedures 
could be said in another way; we are splitting the data until 
it couldn’t split any more. So what we need is where to split, 
and when do we stop splitting. With the method of 
accumulate distribution function, we construct the trees in 
following steps. First, calculate the cdf for each class. 
Second, compare the result for each class in all the points. 
Third, the largest value will be picked and the x will be 
the split point. Repeat first to third step until no more point 
to split. In the example, we calculate the cdf for each class as 
a function of each attribute (see Figure 5), and then pick the 
split point where the difference of the two cdf values is 
maximum. 
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Figure 6 Calculate split point by using cdf{ 1) 



We repeatedly split until all samples in a node are of the 
same class. In Figure 6, the horizontal axis is the possible 
split points S t 1 < / <= 5 corresponding to the x-axis in 
Figure 6, and the vertical axis is the value of the cdf for each 
class. In Figure 6 f x is the cdf value for class 1 (cylinders) 
and f 2 is the cdf value for class 2 (cubes). 

In Figue 5, the vertical axis is the possible split points. 

5, 5 < i <= 10 corresponding to the y-axis in Figure 5, and 
the horizontal axis is the value of the cdf for each class. In 
Figure 6 is the cdf value for class 1 (cylinders) and f 2 
is the cdf value for class 2 (cubes). The purpose is to find 
the point where the distance between f x and f 1 is the 
maximum. To calculate the cdf we used the estimated 
function n i / N t . The total number of samples in class 1 is 

6, and in class 2 is 4. N j = 6 and = 4. As shown in 
Figure 5 at split point S', , we have the f t value of 1/6, and 
f 2 value of 0. At split point S-, we have the f x value of 
2/6, and a f 2 value of 1/4. At split point S 3 we have the 
value no change, still 2/6 or 1/3, and a f 2 value of 2/4 or 
1/2. At split point S< we have the f x value of 1/2 and a f 1 
value of 3/4. At split point S y we have the f x value of 5/6 
and a f-, value of no change, 3/4. 


j 



Similarly in Finnic 7 at Split tile Value of f x is 1/6 and 
f 2 is 0. At split point S 7 we 
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have the f } value of 3/6, and a f 2 value of 0. With the 
same calculations, at split point S s we have the f x value of 
4/6 and a fa value of 0. Again, at split point S 9 we have 
the f x value of 5/6 and a f 2 value of 1/4. At split point 
S 10 , w'e have the f x value of 1 and a f 2 value of 2/4. 

From Figures 6 and 7, we can see that the split has the 
maximum distance (4/6) between f x and f 2 among all 
others. Therefore, we pick the first split point as S x . After 
we split the set on S g , we have two subsets, one of the 

subsets has only class 1 in it. and so we don’t need to do the 
further split on this subset. But on the other subset, we will 
repeat the same calculations on the remaining samples to 
find the further split points. The procedures to calculate the 
cdf and select the maximum distance between f x and 
f 7 are the same as above. The constructed tree is shown in 
Figure 8. 


& 



Figure 8 Final Decision Tree 

When the real time data come in, we let them go through the 
decision tree that we constructed. The faults occurred when 
the data sample fall in fault class at the terminal node. To 
illustrate how the fault happened, we can show the fault by 
tracking the path that the data went through. Visualization 
of the path with the distinctive color or shape will show user 
the clear clue of cause of the fault. 

This method not only can apply to the conversion of the 
fault trees to decision trees, it can also construct decision 
trees from data samples at run time of the operation of ISS 
over time. By simply select a set of data samples from time 
to time, we can built decision trees. In later time, the built 
decision trees can be used to compare the new data and to 
predict future fault. The best use of such trees is to build 
trees by applying grouped fault scenarios. Then applying 
real time data to the tree to compare the pattern, the faults 
can be detected when a pattern is matched. 

T h i s method can not only apply to the conversion of the 
fault trees to decision trees, it can also construct decision 
trees from data samples at run time of the operation of 
International Space Station over time. We can select a set of 
data samples, especially the ones that are representative, 
from time to time and built decision trees for those systems 
by fault scenarios. Data patterns are captured in the tree and 
can be recognized when the future data samples pass 
through the trees. When real time data samples applied to 
the tree, the faults pattern could be recognized and the fault 
could be detected. 

3. Applications 


From above illustration, we can migrate the fault trees to 
decision trees. We also can build decision trees from 
event and data. The decision trees converted from fault 
trees could be used as diagnostic tools when the fault 
happened, run the data through the trees and fmd out 





whcic data stop, in order to find out what fault. Other 
applications are also possible by utilizing decision trees. 
One we know it that the decision tree is very well suit for 
data mining task, we can apply our trees to an data 
mining application targeting at recognizing fault patterns 
and do early fault detection and prediction. A design 
model , a frame work model, is presented in Figure 9. In 
the figure, we can see the decision trees fit into 
knowledge discovery part of the data mining process [6]. 
We have initial decision trees in there for fault diagnostic 
and we have on going decision trees building on real time 
when the system is running. We can build such a tree that 
records fault patterns each time when a fault event 
occurs. Especially, we record the fault trends patterns so 
we can use such tree to recognize fault in its early stage. 


Kmuicdgc Da cover. 



Figure 9 Decision trees in data mining application 


4. Conclusion 

We started from ISS fault trees example to migrate to 
decision trees, presented a method to convert fault trees to 
decision trees. The method shows that the visualizations of 
root cause of fault are easier and the tree manipulating 
becomes more programmatic via available decision tree 
programs. The visualization of decision trees for the 
diagnostic shows a format of straight forward and easy 
understands. For ISS real time fault diagnostic, the status of 
the systems could be shown by running the signals through 
the trees and see where it stops at. The other advantage to 
use decision trees is that the trees can learn the fault patterns 
and predict the future fault from the historic data. The 
learning is not only on the static data sets but also can be 
online, through accumulating the real time data sets, the 
decision trees can gain and store faults patterns in the trees 
and recognize them when they come. 


5. Future Development 

This paper presented the method to migrate the fault trees to 
decision trees, which lays a good foundation for using data 
mining technique in advanced diagnostic system. The next 
step will naturally fall to a project to implement a data 
mining software for fault detection, prediction, and analysis. 
Such software will use the decision trees as an engine inside 


of the diagnostic system application This engine will he 
able to gain knowledge of fault patterns then recognize it. 
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